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Abstract 

The formal derivation of Langevin equations (and, equivalently Fokker-Planck equa¬ 
tions) with projection operator techniques of Mori, Zwanzig, Kawasaki and others ap¬ 
parently not has widely found its way into textbooks. It has been reproduced dozens 
of times on the fly with many references to the literature and without adding much 
substantially new. Here we follow the tradition, but strive to produce a self-contained 
text. Furthermore, we address questions that naturally arise in the derivation. Among 
other things the meaning of the divergence of the Poisson brackets is explained, and 
the role of nonlinear damping coefficients is clarified. The derivation relies on classi¬ 
cal mechanics, and encompasses everything one can construct from point particles and 
potentials; solids, liquids, liquid crystals, conductors, polymers, systems with spin-like 
degrees of freedom ... Einstein relations and Onsager reciprocity relations come for free. 


1 Notation 


Coordinates in phase space are denoted by F = g„}, where p are the momenta and q the 

coordinates. A Hamiltonian H (F) dehnes a flow in phase space according to 

dV/dt = [F, H] = iLF, 

where [V, V] = y) ~ Poisson bracket and L = i [H, ...] is the Liouville 

operator. The expression 

r(F,t) = e*"F 

is a function in phase space mapping every point F to the value of the point a time step 
t in the future. A simple example is H = {p‘^ + q‘^)/2 with F(F,t) = {p{t),q(t)) = 
{pcost — qsint, qcost + psint). More generally A(F,t) = (F) is the value of an ar¬ 

bitrary function A (F) at time t when the system is in state F at t = 0. The density of the 
phase space points of an ensemble is dehned as 

p(F,f) = (J(F-F(Fo,t)))p = (jj5(Pn-Pn(t))5(gn-gn(t))) , 

° / {Pn{0),qn{0)} 

where the angular bracket denotes the ensemble average. The time derivative of p at a given 
point F is 


A 

m 


g{T,t) 


dp 

dp 


i-p) + 



dp dH dp dH 
dp dq dq dp 


[H, p] = -iLp. 
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A formal solution of this equation is = e *^V(r,0). Of particular interest is the 

probability 


p{a,t) = J dT6 {a- A{T))g{T,t) = J dT6 (a - A (h, t)) ^ (h) 

to observe values a = {a*} of observables A = {Aj} at time t when the ensemble is in a 
state with phase space density ^?(^) at time t = 0. The eqnilibrium distribntion fnnction of 
observables is written in the form 

Po (a) da = const x (1) 

where -ffe// («) is called effective Hamiltonian. 


2 Projection Operator 

The Zwanzig projection[l] operator operates on fnnctions in the 6-iV-dimensional phase space 
of N point partices and projects onto the linear subspace of ’’slow” phase space functions. 
It was introduced by R. Zwanzig to derive a generic master equation. It is mostly used in 
this or similar context in a formal way to derive eqnations of motion for some ” slow” collec¬ 
tive variables. A special subset of these functions is an ennmerable set of ’’slow variables” 
A(r) = {A„(r)}. Candidates for some of these variables might be the long-wavelenght 
Fourier components of conserved quantities like energy, momentum and mass density. Relax¬ 
ation of snch qnantities requires a transport of the conserved qnantity over the distance of a 
wavelength, and thus is much slower than the relaxation of local qnantities, which typically 
occurs within a few collision times. Another example is the order parameter of a second 
order phase transition, the dynamics of which slows down at the critical point. The Zwanzig 
projection operator relies on these functions bnt doesn’t tell how to hnd the slow variables 
of a given Hamiltonian H (T). 

2.1 Slow variables and scalar product 

A scalar product between two phase space fnnctions /i (T) and /2 (T) is dehned by the 
eqnilibrinm correlation function[3] 

(/iA)^ = j drp„ (r) h (r) h (r), (2) 

where Po (r) = is the canonical equilibrinm distribntion. ’’Fast” variables, by 

dehnition, are orthogonal to all fnnctions G (A (F)) of A (F) nnder this scalar product. This 
dehnition states that fluctuations of fast and slow variables are uncorrelated, and according 
to the ergodic hypothesis this also is trne for time averages. If a generic function /(F) is 
correlated with some slow variables, then one can snbtract functions of slow variables until 
there remains the uncorrelated fast part of /(F). The dehnition also means that the average 
of a fast variable vanishes. The product of a slow and a fast variable is a fast variable. 

The concise notation 

(/)^ = j dr'p„ (C) / (C) 

which completely hides the integration variables is used systematically below. 
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2.2 The projection operator 

Consider the continuous set of functions <ha(yl(r)) = (5(A(r)—a) = <5 (F) — a„) 

with a = {ttn} constant. Any phase space function G(A(r)) depending on F only through 
A (F) is a function of the <ha, namely 


G(A(F)) = 


daG (a) 5 (A (F) — a). 


A generic phase space function / (F) decomposes according to 

/(F) = F(A(F)) + i?(F), 

where R{T) is the fast part of / (F). To get an expression for the slow part F (A (F)) of / 
take the scalar product (2) with the slow function 5 (A (F) — a), 


{6{A-a)f), = {6{A-a)F{A)), = F{a){6{A-a)),. 


This gives an expression for F{a), and thus for the operator P projecting an arbitrary 
function /(F) to its ’’slow” part depending on F only through A (F), 


P-/(F) = F(A(F)) 


(^(A-A(F))/)p 
(5(A-A(F)))o • 


(3) 


This expression agrees with the expression given by Zwanzig[l], except that Zwanzig uses 
a microcanonical ensemble and subsumes H (F) under the slow variables.^ The Zwanzig 
projection operator fulhlls P ■ G {A (T)) = G (A (F)) and = P. The fast part of /(F) is 
(1 — P) / (F). Functions of slow variables and in particular products of slow variables are 
slow variables. The space of slow variables thus is an algebra. The algebra in general is not 
closed under the Poisson bracket, including the Poisson bracket with the Hamiltonian. The 
projection operator (3) can also be written in the form 




(4) 


where po (a) = (d (a — A))q is the equilibrium distribution (1) of A. 


3 Exact generalized Langevin Equations 

The starting point for the standard derivation of a Langevin equation is the identity 1 = 
P + Q, where Q projects onto the fast subspace. It is important in the following that the 
Liouville operator normally doesn’t commute with the Zwanzig projection operator - time 
evolution mixes fast and slow variables, [L, P] ^ 0. 

Consider discrete small time steps r with evolution operator ~ 1 + irL, where L is the 
Liouville operator. The goal is to express in terms of U^P and Q {UQ)^. The motivation 
is that U^P is a functional of slow variables and that Q {UQ)^ generates expressions which 
are fast variables at every time step. The expectation is that fast variables isolated in this 
way can be represented by some model data, for instance a Gaussian white noise. The 

heat bath in contact with the boundary of the system is irrelevant for the dynamics in the inner parts 
of the system. The only difference between canonical and microcanonical ensemble is that the microcanonical 
ensemble doesn’t have the k = 0 degree of freedom of the energy density. 
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decomposition is achieved by multiplying 1 = P + Q from the left with f/, except for the last 
term, which is multiplied with U = PU + QU. Iteration gives 


1 = P + Q, 

U = UP + PUQ + QUQ, 

= u^p+ UPUQ + PUQUQ+ QUQUQ, 

n 

pn ^ U^p+J2u^~'^P{UQ)^+ Q{UQ)^. 

m=l 


The last line can also be proved by induction.^ Now assume U = 1 + itL/n and perform the 
limit n —)■ cxD. This directly leads to the operator identity of Kawasaki [2] 

ptL = p ■ f ^^p{t-s)Lpj^QpsLQ p QptLQ _ 


A generalized exact Langevin equation is obtained by applying this equation to dA (T, t) /dt = 
{dA {T,t) /dt)^^Q, 


d 

dt 


A(r,t) = V + K + R, 


V{T,t) = e**^PA(r,0), 

K{r,t) = if (T, 0) = f / dse*(*-")^PLP (T, s), 

Jo Jo 

R{T,t) = QP^^^A{r,0). 


(5) 


The contribution R is the fluctuating force. The K term is the damping, it simplihes with 
the expression for the fluctuating force. The contribution V is called mode coupling. It 
corresponds to reversible modes like spin or plasma waves. Mode coupling term V and 
damping term K are functionals of A (T, t) and A (T, f — s) respectively and can be simplihed 
considerably. 


3.1 Mode Coupling 


Inserting the expression (4) for the Zwanzig projection into the K-term from eq.(5) gives 

K,(r,f) = e“P[A,P]= [ J^S{a-A{r,t))([A,H]S{a-A)) . (6) 

J Po yd) \ / 0 

The expectation value (...)q simplihes with [Aj, p[]po = — (1//9) [Aj, po] and partial integration 
to 


(■■■>o(«) = f dT'\A„po]S(a-A(T'))^p{\S(a-A},A!i) 


-1 

T 

-1 / d5{a- A) 

^ dA, 


/S 

-1 

T 


3 3 


A=a 


^The factor UQ is rather formal. It means projecting away the slow part of the observable after each time 
step. Hamiltonian and the trajectory of the system point in phase space are unchanged. 
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Inserting (...)g into (6) and writing po (®) = const x e finally gives 




A=A{r,t) 




dH, 


eff 


dAi 


A{r,t) 


+-Dj, 


d 




yl=yl(r,t) 


( 7 ) 


The mode coupling (7) is a functional of A (T, t), and the Langevin equation (5) with only V 
on the r.h.s. agrees with the classical (reversible) equation of motion of variables A except 
for the purely kinematic divergence D{A). In fact, D actually vanishes in most standard 
cases and it is reasonable to suggest to use variables A for which D = 0. The meaning of 
this condition is explained in detail in the appendix. 


3.2 Damping term 

Inserting the expression (4) for the projection operator into the damping term Ki from eq.(5) 
gives 


Ki = 


ds I —{A {t — s) — a) (^5 (a — A) iLRi (s)^ 


Po fa 


The Liouville operator in the expectation value (...)o can be moved to the (5-function by 
means of a partial integration, which gives the negative time derivative of the delta function, 

d ^d5 (a — A) • *7 , , • 

--IHa-A)^-Y^ ' ’ Ai^Y^—S{a-A)A, 


dt 


and thus 


Ki = [ ds [ , (5 {A {t — s) — a) (s (a — A) Aj (0) Ri (s) 

LliCL'j 


Po fa 


The product of the slow part of Aj and S {a — A) is a slow variable and doesn’t contribute 
to (...)o- It therefore can be replaced with QAj (0) = Rj (0) from eq.(5). There results the 
exact expression 

77* (r,t) = V / ds( ^ ^ Po (a) (a, s)] , (8) 

^Jo \po{a)daj J a=A(t-s) 

where 

Ay*(a,s) = — ^-—U{a-A)Rj{t))Ri{s)) = (0) i?* (s); a) (9) 

are the correlation functions of the fluctuating forces restricted to a subspace with given values 
a for the slow variables. K (T, t) in effect is a functional of A [t — s) and j {A {t — s) ,s). 
The exact evolution A{t — s) of A over the time interval [s, t] here is uninteresting, the 
nontrivial part is the evolution R{s) of the fluctuating force R over the time interval [0,s]. 
If the variables A change much more slowly than R then one can think of Aj^i [a, s) as a time 
average in a situation where the variables A have the value a. 
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3.3 Fluctuating force 

The fluctuating force R(t) = R(T,t) is a fast variable according to definition (5) at every 
time t and therefore {R^t))^ = 0. It also is easy to verify that i? is a stationary process: the 
correlation function 

(s) R, {t + (0) ■ (0))^ 

= (i?, (0) ■ QR^^^Rj (0))^ = l^Ri (0) R, it))^ 

is invariant under an arbitrary time shift s. The proof only requires the fairly trivial rules 
= Q) {^Q^)o ~ iiQ^) ^)o “ ii^^) ^)o move operators from a factor 

X to a factor Y. 

Time inversion symmetry then shows that the correlation function is a symmetric matrix^ 

Xi,j (i) = (Ri (0) Rj (t)^^ = (^Ri (0) Rj = (Ri (i) Rj (0))^ = Xj,i (i) ■ 

Because i? is a fast variable Xtj {t) decays relatively fast, in a time of order r. But it is not 
necessarily true that this decay can be described by a process independent of A. The variables 
A are qnasi constant in a time interval r, and the fast variables effectively are restricted to a 
smaller phase space. A more detailed eqnation which takes this dependence into acconnt is 

Xi,j{t) = j dapo{a)Aij{a,t), 

Ai,j (a, t) = —(a - A) Ri (0) Rj (t)\ = IRi (0) Rj {t ); a) , 

where A is a correlation fnnction in which H and A are constant. The interesting aspect is 
that this correlation is identical with the correlation fnnction in the damping term (8). 


4 Markov approximation 


Fnrther simplifications are possible if the slow variables really are slow in comparison to the 
fast variables. In that case it snggests itself to replace the unknown fast variables R{TR) 
with a Gaussian white noise. The memory effects in the damping (8) disappear and there 
results the Langevin equation 


dAj 

dt 




dH, 


eff 


dAj 




(A) ^ + ksT ^ 


dAj 


dA^ 


+ n (t ), 


( 10 ) 


(r, (0) r, it) ■ a) = 2kBT\,, (A) 5 (t). (11) 

This is the final resnlt. It contains Onsager reciprocity Ajj = Xjj and the Einstein relations 
(11). A special featnre of this Langevin equation is the extra term proportional to ksT on 
the r.h.s. The formal derivation explicitly allows damping coefficients depending on the slow 
variables A. The extra term is a conseqnence. This nnconventional term is of order O (A°) 

^As usual, if external magnetic fields are important then time inversion symmetry also requires to invert 
the external currents generating the magnetic fields. 
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for a linear dependence of A on A and appears to be large, bnt this is misleading. As can be 
seen from 

\ j J j 

the extra term cannot be seen in isolation and is reqnired to have (dA/dt) = 0. Fnrthermore, 
as can be seen from 



n 


E 


A 


dH, 


eff 


dAi 


-knT 


n 


E 


dattj- — Pq 

dttj 


a) = kuT 


the extra term actnally is small in comparison to the second term if the A-dependence of A is 
small. In that case it thns is justihed to use a conventional Langevin equation with constant 
damping coefficients and without the extra term. 

It also is of interest that the dependence of the damping coefficients on the slow variables in 
a way is trivial at the level of the Langevin equation. Constant coefficients (A) transform 
to non-constant coefficients (B) in the Langevin equation for variables B which are in 
(nonlinear) bijection with A. However, the transformation law of the extra term is rather 
complicated, see appendix. 


5 Summary and perspective 

This hopefully sufficiently self-contained article has reproduced the derivation of generalized 
nonlinear Langevin equations from classical statistical mechanics. Questions addressed are 
the stationarity of the fluctuating forces and the relevance of unconventional extra terms 
caused by the divergence of Poisson brackets and nonlinear damping coefficients. 

In general the formalism doesn’t leave much to be desired. A possible caveat is the 
division of the degrees of freedom according to the category fast and slow. There rarely is 
a clear-cut limit and the division is more or less arbitrary. Repeating the derivation with 
a shifted limit generates a Langevin equation with modihed parameters. This is equivalent 
to the renormalization group, and the question is whether the formalism generates a simple 
Langevin equation of dehnite form in the end. In the case of critical phenomena this dehnitely 
is the case, the hxed points of the renormalization group are of the Langevin equation type, 
albeit with nontrivial coefficients. The formalism, however, in principle works with any 
projection operator, but interpretation and application of the generalized Langevin equations 
then may be difficult. 


Appendix 

Divergence of Poisson brackets 

The mode coupling (7) contains a term reproducing the classical mechanics of A with a 
Hamiltonian i7e//(A) but also an additional purely kinematic term containing the Poisson 
bracket divergence 

Di(T,t) = -ksTY.'^PlAA,]. 

The meaning of this extra contribution can be elucidated by repeating the derivation of the 
mode coupling term for slow variables B which are in bijection with A. The set of slow 
variables effectively is the same and the projection operator is unchanged.^ The new aspect 

special case of this is P = 1 where A and B are in bijection with T, without any fast variables. 
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is that now the probability distribution 


{B) = const X exp (B)'j = po (A) \dA/dB\ (12) 

of the variables B can be related to the probability distribution of the variables A. This 
means that H^ff (A) and H^fj (B) cannot agree if the Jacobian isn’t constant. 

Theorem: If A and B are in bijection then the following three statements are equivalent: 


V, 


d 


d 


.4,1=0 « Vo,.,, 5;;^|G,A] = 0 « 


dAi 


dB^ 


dA 


dB 


= 0 . 


The connection between hrst and second statement is trivial. The geometric meaning is 
that every G (A) generates a divergenceless Hamiltonian flow in H-space. The hrst statement 
implies that the mode coupling for A is of standard form {D = 0) and that the mode coupling 
for B also can be written in a simple form, 


dAi/dt = [Hj, Aj] dHeff (H) /dAj, (13) 

j 

dBi/dt = 

j 


On the other hand, the formal derivation of an equation of motion for B as above gives the 
mode coupling 


dBi/dt 



1 ^ 




dH,ff (.1 (B)) 
dBj 



dA 

dB 


where (12) was used. Comparison with the mode coupling (13) deduced from the equation 
of motion for A proves the third statement. Equivalence with the hrst statement follows by 
symmetry. 

The meaning of all this is that the unconventional extra term D is associated with the 
Jacobian of the variable transformation. If the Jacobian is constant and D (H) = 0 then 
also D (B) = 0. In other words, the extra term arises when the exponent of the probability 
distribution isn’t the energy. 

The fact that H = 0 in standard cases can be understood from another point of view. 
Often the variables A directly are related to the generators of a Lie algebra. For example, 
for Heisenberg ferromagnets (model ’J’ of critical dynamics) A = {mi, m 2 , ms}, where the 
nii are the components of the magnetization density. The Poisson brackets then generate the 
Lie algebra (decorated with a J-function) 

[Hi (x), Aj (x')] = h^Am {x) J (x - x'), 


where the structure constants h are a matrix representation of the algebra (the adjoint 
representation). According to a theorem of group theory real representations of hnite and 
compact groups are equivalent to orthogonal representations and thus have vanishing trace. 
This in part explains why standard Langevin equations don’t have the unconventional extra 
contribution. 



Variable transformations in Langevin equations 


Under a bijection of variables A -H- B one has 


. ^ dBi 

AR = > —^AAr 


1 


dPB, 


dAr, 


2 dAmdA. 


-AAfnAAn + .... 


Dividing by a time interval At and inserting the Langevin equation (10) leads to the Langevin 
equation for 5, 

dBi ^ dBi / dAm\ , 7 " d^Bi 
H ~ ^ dA^ ■ ^ 2 ^ dA^dAn^' 

A word of caution is in order here: this is not standard analysis! B has to be expanded 
to second order in A because the product of two fluctuating forces from the r.h.s. of the 
Langevin equation for A is a delta function, see eq.(lO). 

This should agree with the Langevin equation for B ala eq.(lO) derived directly. There 
follow the identihcations 




E 


(A) 

dA^ ^ 


it), 


E 

m,n 


dB'i dBj ^ 


The transformation law of the extra term can be found with the divergence rule 


E 


d 

dBj 



dA 

d dAn (B) 

dB 

dB 

2-^ dAn dBj 

dA 


according to 


E 



dBj 


dA 

d dAn .(B) 

dB 

dB 

2^ dAn dBj 

dA 


dA 

\ ^ dBi 

dB 

dB 

^ dAn dA^""’" 

dA 


y 


dBi f d\\m}n . d dB 
dxi \ dAn ^ dA 


+ E 

m,n 


^ \(A) 

dA^dA^ 


Even if the original damping coeffcients are constant then a nonlinear variable transfor¬ 
mation generates non-constant damping coefficients However, slow variables normally 
have a certain meaning (conserved quantities, order parameter, ...), and a nonlinear trans¬ 
formation thus is rather formal. 
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